Split pragmatics into presuppositions and scalar implicatures #2938

raileymontalan · 2024-08-16T09:13:38Z

No description provided.

raileymontalan · 2024-09-06T14:45:39Z

Hi @weiqipedia, for your info.

yifanmai

Looks good overall. Note that you have to change schema_bhasa.yaml to reflect changes (but that can be done in a separate pull request).

src/helm/benchmark/scenarios/bhasa_scenario.py

yifanmai · 2024-09-09T22:35:01Z

src/helm/benchmark/scenarios/bhasa_scenario.py

+                )
+                # Split "True or False" into ["True", "or", "False"]
+                choices = row["choices"].split()
+                choices_translated = row["choices_translated"].split()


Does this work consistently across every (supported) language?

That's a good question! For now we only have Indonesian (and Tamil), and this splitting and taking the first and third index of the list does work for both languages. But just FYI, this will not work for Thai because of the lack of spaces, and we'll have to use something more similar to your suggestion of " or " (but we will not be having Thai any time soon)

src/helm/benchmark/scenarios/bhasa_scenario.py

run_eval.sh

src/helm/benchmark/run_specs/bhasa_run_specs.py

yifanmai · 2024-09-24T20:57:47Z

src/helm/benchmark/scenarios/bhasa_scenario.py

+        if self.language not in self.prompts.keys():
+            raise (Exception(f"Unsupported language {self.language} - supported languages are {self.prompts.keys()}"))
+        else:
+            self.prompt_components = self.prompts[self.language]

    def download_dataset(self, output_path: str):
        BASE_URL = "https://raw.githubusercontent.com/aisingapore/BHASA/main/lindsea/"


Optional: You can pin this to a specific commit githash so that future changes to the git won't cause this scenario to change. e.g.

BASE_URL = "https://raw.githubusercontent.com/aisingapore/BHASA/10e34008e8142bef400cf8ffab15b2b6aaf3aa7f/lindsea/"

src/helm/benchmark/scenarios/bhasa_scenario.py

yifanmai

LGTM, thanks!

yifanmai · 2024-10-01T22:38:28Z

src/helm/benchmark/scenarios/bhasa_scenario.py

-        dataset = pd.read_json(target_path_file, lines=True)
+        datasets = []
+        for subset in self.subsets:
+            URL = f"{BASE_URL}{self.language}/pragmatics/pragmatic_reasoning_{subset}.jsonl"


nit: URL should be lowercase (it is not a constant)

raileymontalan added 4 commits July 1, 2024 02:54

Add LINDSEA pragmatics subset

999bdec

Split pragmatics

fdad90d

Split pragmatics into pressupositions and scalar implicatures

44b9a04

Update run entries for pragmatics

e903728

raileymontalan marked this pull request as draft August 16, 2024 09:14

raileymontalan added 9 commits August 17, 2024 02:48

Fix formatting

2e70dfe

Rerun unit tests

a27b9e8

Merge branch 'stanford-crfm:main' into lindsea_pragmatics_scenario_split

de472e6

Enforce input typing

14b31f5

Enforce input typing

4b14707

Remove line

8648ac5

Fix type checks

7ac4866

Fix line error

8e84dd2

Simplify dict

6bad749

raileymontalan marked this pull request as ready for review September 6, 2024 14:44

yifanmai reviewed Sep 9, 2024

View reviewed changes

raileymontalan added 4 commits September 12, 2024 03:45

Update BHASA schema, add exception for unsupported languages for LINDSEA

86ad100

Add file extension for downloaded files

b8020e3

Fix naming convention

13400af

Add error raising for unsupported langauges

3d88380

raileymontalan requested a review from yifanmai September 23, 2024 05:22

yifanmai requested changes Sep 24, 2024

View reviewed changes

raileymontalan force-pushed the lindsea_pragmatics_scenario_split branch from 0ab8fc3 to 3d88380 Compare September 30, 2024 05:20

Fix nitpicks

8660d1d

raileymontalan requested a review from yifanmai September 30, 2024 06:02

yifanmai approved these changes Oct 1, 2024

View reviewed changes

yifanmai merged commit 64f23d3 into stanford-crfm:main Oct 1, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split pragmatics into presuppositions and scalar implicatures #2938

Split pragmatics into presuppositions and scalar implicatures #2938

raileymontalan commented Aug 16, 2024

raileymontalan commented Sep 6, 2024

yifanmai left a comment

yifanmai Sep 9, 2024

weiqipedia Sep 10, 2024

yifanmai Sep 24, 2024

yifanmai left a comment

yifanmai Oct 1, 2024

Split pragmatics into presuppositions and scalar implicatures #2938

Split pragmatics into presuppositions and scalar implicatures #2938

Conversation

raileymontalan commented Aug 16, 2024

raileymontalan commented Sep 6, 2024

yifanmai left a comment

Choose a reason for hiding this comment

yifanmai Sep 9, 2024

Choose a reason for hiding this comment

weiqipedia Sep 10, 2024

Choose a reason for hiding this comment

yifanmai Sep 24, 2024

Choose a reason for hiding this comment

yifanmai left a comment

Choose a reason for hiding this comment

yifanmai Oct 1, 2024

Choose a reason for hiding this comment